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Abstract. Let K be a totally real number field with Galois closure L. We 
prove that if / G Q[xi ,x n ] is a sum of m squares in K[x\, . . . , x n ], then / 
is a sum of 

ta . 2 W( [i: « 1 + 1 ) 

squares in Q[xi, . . . , x n ]. Moreover, our argument is constructive and gen- 
eralizes to the case of commutative /("-algebras. This result gives a partial 
resolution to a question of Sturmfels on the algebraic degree of certain semi- 
definite programing problems. 



1. Introduction 

In recent years, techniques from semidefinite programming have produced nu- 
merical algorithms for finding representations of positive semidefinite polynomials 
as sums of squares. These algorithms have many applications in optimization, con- 
trol theory, quadratic programming, and matrix analysis [HI HU EH [22l [23] . For a 
noncommutative application of these techniques to a famous trace conjecture, see 
the papers [TJ [5J [T21 [TB] which continue on the work of [9] • 

One major drawback with these algorithms is that their output is, in general, 
numerical. For many applications, however, exact polynomial identities are needed. 
In this regard, Sturmfels has asked whether a representation with real coefficients 
implies one over the rationals. 

Question 1.1 (Sturmfels). If f € Q[xi, . . . , x n ] is a sum of squares in M.[x±, . . . , x n ], 
then is f also a sum of squares in Q[xi, . . . , x n ] ? 

It is well-known that a polynomial is a sum of real polynomial squares if and 
only if it can be written in the form 

(1.1) / = v T i?v, 

in which v is a column vector of monomials and B is a real positive semidefinite 
(square) matrix [55]; in this case, the matrix B is called a Gram matrix for /. If B 
happens to have rational entries, then / is a sum of squares in Q[xi, . . . , x n ] (this 
follows from a Cholesky factorization argument or from a matrix generalization of 
Lagrange's four square theorem [10]). Thus, in the language of quadratic forms, 
Sturmfels is asking whether the existence of a positive semidefinite Gram matrix 
for / € Q[xi, . . . , x n ] over the reals implies that one exists over the rationals. 
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Although a complete answer to Qucstion ll.ll is not known, Parrilo and Peyrl have 
written an implementation of SOSTOOLS in the algebra package Macaulay 2 that 
attempts to find rational representations of polynomials that are sums of squares 
[20] . Their idea is to approximate a real Gram matrix B with rational numbers 
and then project back to the linear space of solutions governed by equation (jl.ip . 

The following result says that Question ll.ll has a positive answer in some "generic" 
sense; it also explains the difficulty of finding counterexamples. 

Theorem 1.2. Let f G Q[xi, ■ ■ ■ , x n \. If there is an invertible Gram matrix B for 
f , then there is a Gram matrix for f with rational entries. 

Proof. Let B be a real positive semidefinite matrix and v a vector of monomials 
such that / = v T Bv. Consider the set L of real symmetric matrices S = S T = (sy ) 
of the same size as B for which / = v T SV. This space corresponds to the solutions 
of a set of linear equations in the over Q. From elementary linear algebra 
(Gaussian elimination), it follows that there is an integer k such that 

L = {S Q + hSi + ■■■ + t k S k :ti,...,t k €R} 

for some rational symmetric matrices Si , . . . , S k ■ The subset of matrices in L that 
are positive definite is determined by a finite set of strict polynomial inequalities 
in the t\, . . . , t k produced by setting all the leading principal minors to be positive 
[Til p. 404]. By continuity, a real positive definite solution B guarantees a rational 
one, and this completes the proof. □ 

Remark 1.3. The argument above shows that we may find a rational Gram matrix 
of the same size as the original Gram matrix B. Is this true even if B is not 
invertible? We suspect not. 

Although the general case seems difficult, Question ll.ll has a positive answer for 
univariate polynomials due to results of Landau [15] , Pourchet [24] , and (algorith- 
mically) Schweighofer [29]. In fact, Pourchet has shown that at most 5 polynomial 
squares in Q[x] are needed to represent every positive semidefinite polynomial in 
Q[x], and this is best possible. 

It follows from Artin's solution to Hilbert's 17th problem [351 Theorem 2.1.12] 
that if / € Q[xi, . . . , x n ) is a sum of squares of rational functions in R(xi, . . . , x n ), 
then it is a sum of squares in Q,(x%, . . . , x n ). Moreover, from the work of Voevodsky 
on the Milnor conjectures, it is known that 2™ +2 such squares suffice [21 p. 530]. 
However, the transition from rational functions to polynomials is often a very del- 
icate one. For instance, not every polynomial that is a sum of squares of rational 
functions is a sum of squares of polynomials [21 p. 398] . 

More generally, Sturmfels is interested in the algebraic degree [17] of maximizing 
a linear functional over the space of all sum of squares representations of a given 
polynomial that is a sum of squares. In the special case of Question 11.11 a positive 
answer signifies an algebraic degree of 1 for this optimization problem. 

General theory (for instance, Tarski's Transfer Principle for real closed fields [2"o1 
Theorem 2.1.10]) reduces Question 1 1.1 1 to one involving real algebraic numbers. In 
this paper, we present a positive answer to this question for a special class of fields. 

Recall that a totally real number field is a finite algebraic extension of Q all of 
whose complex embeddings lie entirely in K. For instance, the field Q(vu) is totally 
real for positive, integral d. Our main theorem is the following. 
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Theorem 1.4. Let K be a totally real number field with Galois closure L. If 
f € . . . ,x n ] is a sum of m squares in K[x\, . . . ,x n ], then f is a sum of 

squares in Q[x%, . . . ,x n ]. 

Our techniques also generalize naturally to the following situation. Let R be a 
commutative Q-algebra and let K be a. totally real number field. Also, set S := 
R £g)Q K, which we naturally identify as a ring extension of R — R <E)q Q. If / is a 
sum of the form 

m 

j=i 

then we say that / is a sum of squares over R. It is a difficult problem to determine 
those / which are sums of squares over R. In this setting, Theorem 1 1 . 41 generalizes 
in the following way. 

Theorem 1.5. Let K be a totally real number field with Galois closure L. If f € R 
is a sum of m squares in R®qK, then f is a sum of Am - 2^ L:< ^ +1 (^'"f +1 ) squares 
over R. 

Remark 1.6. One can view Theorem 1 1.5 1 as a "going-down" theorem j4] for certain 
quadratic forms over the rings R and R ®q K. We do not know how much the 
factor 2[ i: "21 +1 ([ L:(! | +1 ) can be improved upon, although we suspect that for poly- 
nomial rings, it can be improved substantially. We remark that it is known [2] that 
arbitrarily large numbers of squares are necessary to represent any sum of squares 
over R[xi, . . . ,x n ], n > 1, making a fixed bound (for a given n) as in the rational 
function case impossible. 

Our proof of Theorem ll.4l is also constructive. 

Example 1.7. Consider the polynomial 

f = 3 - 12y - 6x 3 + I8y 2 + 3x 6 + 12x 3 y - 6xy 3 + 6x 2 y A . 

This polynomial is a sum of squares over R[x, y] . To see this, let a, /3, 7 G M be the 
roots of the polynomial u{x) — x 3, — 3x + 1. Then, a computation reveals that 

f = (x 3 + a 2 y + (3xy 2 - I) 2 + (x 3 + (3 2 y + jxy 2 - l) 2 + (x 3 + rfy + axy 2 - l) 2 . 

Using our techniques, we can construct from this representation one over Q; 

(x 3 + xy 2 + 3y/2 - l) 2 + (x 3 + 2y - l) 2 + (x 3 - xy 2 + 5y/2 - l) 2 

+ (2y~xy 2 ) 2 +3y 2 /2 + 3xV- 
This example will be revisited many times in the sequel to illustrate our proof. □ 

We believe that in Theorem [Lj] the field K may be replaced by any real algebraic 
extension of the rationals (thus giving a positive answer to Sturmfels' question); 
however, our techniques do not readily generalize to this situation. We shall discuss 
the obstructions throughout our presentation. 

The organization of this paper is as follows. In Section [2l we set up our no- 
tation and state a weaker (but still sufficient) version of our main theorem. Sec- 
tion [3] describes a matrix factorization for Vandermonde matrices. This construc- 
tion is applied in the subsequent section to reduce the problem to the case K = 
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Q{\/h, ■ ■ ■ , \/Tr) for positive integers Ik- Finally, the proof of Theorem 11.41 is com- 
pleted in Section [5J For simplicity of exposition, we shall focus on the polynomial 
version of our main theorem although it is an easy matter to translate the techniques 
to prove the more general Theorem 11.51 

We would like to thank T. Y. Lam, Bruce Reznick, and Bernd Sturmfels for 
interesting discussions about this problem. We also thank the anonymous referee 
for several suggestions that improved the exposition of this work. 



2. Preliminaries 

An equivalent definition of a totally real number field K is that it is a field 
generated by a root of an irreducible polynomial u(x) £ Q[x], all of whose zeroes 
are real. For instance, the field K = Q(a,/3, 7) = Q(a) arising in Example 11.71 is a 
totally real (Galois) extension of Q in this sense. A splitting field of u(x) (a Galois 
closure of K) is also totally real, so we lose no generality in assuming that K is 
a totally real Galois extension of Q. We will therefore assume from now on that 
K = Q(9) is Galois and that 9 is a real algebraic number, all of whose conjugates 
are also real. We set r = [K : Q] and let G be the Galois group Gal(K/Q). For the 
rest of our discussion, we will fix K = Q(9) with these parameters. 

We begin by stating a weaker formulation of Theorem 11.41 For the purposes of 
this work, a rational sum of squares is a linear combination of squares with positive 
rational coefficients. 

Theorem 2.1. Let K be a totally real number field that is Galois over Q. Then 
for any p € K[x\, . . . , x n ], the polynomial 

can be written as a rational sum of 2^ K: ^ +1 squares in Q[xi, . . . ,x n }. 

Remark 2.2. Elements of the form ^2 aeG (o~p) 2 are also sometimes called trace forms 
for the field extension K [T2] p. 217]. 

It is elementary, but important that this result implies Theorem II .41 



Proof of Theorem \1.4\ Let / = J3i=i-Pi Q[^i) ■ ■ ■ ,x n \ be a sum of squares with 
each pi € K[x\, . . . , x n \. Summing both sides of this equation over all actions of 
G = Gal(K/Q), we have 

The conclusions of Theorem 11.41 now follow immediately from Theorem 12.11 and 
Lagrange's four square theorem (every positive rational number is the sum of at 
most four squares). □ 



Remark 2.3. This averaging argument can also be found in the papers [3J [6] . 
We will focus our remaining efforts, therefore, on proving Theorem 12.11 
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3. Vandermonde Factorizations 



To prepare for the proof of Theorem 12. 1\ we describe a useful matrix factor- 
ization. It is inspired by Ilyusheckin's recent proof [12j that the discriminant of 
a symmetric matrix of indeterminates is a sum of squares, although it would not 
surprise us if the factorization was known much earlier. 

Let A — A T be an r x r symmetric matrix over a field F of characteristic not 
equal to 2, and let yi, . . . ,y r be the eigenvalues of A in an algebraic closure of F. 
Also, let V r be the Vandermonde matrix 



V r = 



1 

2/1 



1 

m 



l 



Vi 



2/2 



The matrix B = VrVj has as its (i, j)th entry the (i + j — 2)th Newton power sum 
of the eigenvalues of A: 

k=l 

Since the trace of A m is also the mth Newton power sum of the yk , it follows that 
we may write B — [tr(A i+J_2 )]^ . =1 6 prxr^ -y^ e nex ^ gj ve ano ther factorization of 
B in the form CC T . 

Let Eij be the r xr matrix with a 1 in the entry and O's elsewhere. A basis 
for r x r symmetric matrices is then given by the following ( r ^ X ) matrices: 

{E u : i = 1, . . . , r} U {(Eij + E 3l )/V2 :l<i<j< r}. 
For example, the "generic" symmetric 2x2 matrix 

(3.1) A = 

with entries in the field F = Q(xn, x±2, X22), is represented in this basis as 

- \/2 ■ X12 



X12 



X12 
X22 



xn 



X22 



1/V2 
1/V2 

This basis is useful since the inner product of two symmetric matrices P and Q 
with respect to this basis is simply tr(PQ), as one can easily check. 

Express the powers A m in terms of this basis and place the vectors of the co- 
efficients as rows of a matrix C. The entries of the r x ( r ^ X ) matrix C will be in 
F[\/2]. Our construction proves the formal identity 

(3.2) V r Vj = [tr(A«- 2 )][ J=1 = CC T . 

Example 3.1. With A given by the factorization reads: 

1 



1 


1 




" 1 


2/1 






2/2 _ 




1 


V2 _ 





1 

Xll 



1 

X22 





a/2 ■ X12 



Xll 

X22 
\/2-xi2 



Algebraically, this equation reflects the fact that for a 2 x 2 symmetric matrix A, 
tx(A) = xn + x 22 , 



tr(A 2 ) = ti(A) 2 - 2det(A) = x\ x 



b 22 



2x 



12- 



6 



CHRISTOPHER J. HILLAR 



□ 



In the next section, we will use the matrix factorization (|3.2j) to replace a Gram 
matrix over Q(yi, . . . , y r ) with one over a much smaller field. 



4. Symmetric matrices with prescribed characteristic polynomial 

Let K — Q{9) be totally real and Galois, and set a\, . . . , ay to be the elements 
of Gal(i4f/Q). Given p € K[x\, . . . , x n ], we may express it in the form 

r-1 

for elements qi € Q[xi, . . . , x ra ]. With this parameterization, the sum 



Eh 2 = EE?,m)' 

creG i=l \i=0 / 

appearing in the statement of Theorem 12.11 may be written succinctly as 
(4.1) 

1 ••• 1 

(Ji9 ■ ■ ■ cr r 9 



Qr 



{<Jlt 



(<T r ey 



1 ait 



1 



{(Trey- 1 





Qo 




q r -i 



Let V r be the Vandermonde matrix appearing in equation (|4.ip . We would like 
to construct a factorization as in (|3.2[) to replace the elements of K with numbers 
from Q(v2). To apply the techniques of Section[3l however, we must find anrxr 
symmetric matrix A whose eigenvalues are ai0, . . . , a r 9 (the roots of the minimal 
polynomial for 9 over Q). A necessary condition is that these numbers are all real, 
but we would like a converse. Unfortunately, a converse with matrices over Q is 
impossible. For the interested reader, we include a proof of this basic fact. 

Proposition 4.1. There is no rational, symmetric matrix with characteristic poly- 
nomial u{x) = x 2 — 3. 



Proof. We argue by way of contradiction. Let 



A = 



a b 
b c 



a,b,c € 



and suppose that 

u(x) = det(xl - A) = x 2 - {a + c)x + (ac - b 2 ). 

It follows that there are rational numbers a and b such that a 2 + b 2 = 3. Multiplying 
by a common denominator, one finds that there must be integer solutions u,v,w 
to the diophantine equation 



(4.2) 



3w z 



Recall from elementary number theory that a number n is the sum of two integral 
squares if and only if every prime p = 3 (mod 4) that appears in the prime factor- 
ization of n appears to an even power. This contradiction finishes the proof. □ 
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If we allow A to contain square roots of rational numbers, however, then there 
is always such a symmetric A. This is the content of a result of Fiedler [5]. We 
include his proof for completeness. 

Theorem 4.2 (Fiedler). Let u(x) G C[x] be monic of degree r and let b\, . . . , b r be 

distinct complex numbers such that u(b k ) / for each k. Set v(x) = Ilfe=i( :r — ^fc) 
and choose any complex numbers di, . . . ,d r and S that satisfy 

Sv'(b k )4-u(b k ) = 0, k = l,...,r. 

Let d — [di, . . . , d r ] T and B — diag(&!, . . . , b r ). Then the symmetric matrix 

A = B — Sdd T 

has characteristic polynomial equal to u{x). 

Proof. Applying the Sherman-Morrison formula pj p. 50] for the determinant of a 
rank 1 perturbation of a matrix, we have 

det(>J - A) = det(>J - B) + Sdet(xl - B)d T (xI - B)^d 

(4 - 3) = f[(x- bk )+sj2dt n (*-*). 

fe=l fe=l i=l,ijtk 

Since the monic polynomial det(x/ — A) and u{x) agree for x = b\, . . . , b r , it follows 
that they are equal. □ 

Remark 4.3. There are simpler, tridiagonal matrices which can replace the matrix 
A (see [28]); however, square roots are still necessary to construct them. 

The following corollary allows us to form a real symmetric matrix with charac- 
teristic polynomial equal to the minimal polynomial for 9 over Q. 

Corollary 4.4. // u(x) € Q[x] is monic of degree r and has r distinct real roots, 
then there are positive rational numbers l\,...,l r and a symmetric matrix A with 
entries in Q(\/^i", • ■ ■ , \ftr) such that the eigenvalues of A are the roots of u{x). 

Proof. Let b±, . . . ,fo r -i be rational numbers such that exactly one bi is (strictly) 
between consecutive roots of u(x), and let b r be a rational number either smaller 
than the least root of u(x) or greater than the largest root of u(x). Also, set 
5 S {— 1, 1} such that l k — Su(bk)/v'(bk) is positive for each k. The corollary now 
follows from Theorem 14.21 by setting d k — \flk~ for each k. □ 

Example 4.5. Consider the polynomial u{x) — x 3 — 3x + 1 from Example |J.7[ 
Choosing (b±, 62, 63) = (0, 1, 2) and 6 — 1, we have d = K/2/2,1, V6/2] T ana 



A = 



-1/2 -V2/2 -y/3/2 
-V2/2 -y/6/2 
-a/3/2 - v / 6/2 1/2 



One can easily verify that the characteristic polynomial of A is u(x). □ 

Combining Corollary |4 . 41 and the construction found in Section[31 we have proved 
the following theorem. 
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Theorem 4.6. Let K be a totally real Galois extension of Q and set r = [K : 
Then for any p £ K[xi, ■ ■ ■ , x n ], there are positive integers h, . . . , l r such that 



x n ] and C is an r x ( r t ) matrix 



crEG 



in which q is a vector of polynomials in Q[xi, 
with entries in F = Q(-\/Ii", . . . , \[T ri V%)- 

To illustrate the computations performed in the proof of Thcorcm l4.61 we present 
the following. 

Example 4.7. We continue with Example \4-5\ Let a £ R be the root of u(x) 
with a £ (1,2). Then, setting (3 = 2 — a — a 2 and 7 = a 2 — 2, we have u(x) = 
(x — a) (x — P)(x — 7). The Galois group of K — Q(a) is cyclic and is generated by 
the element a G G such that cr(a) — (3. If we letv = [x 3 + 2xy 2 — 1, — xy 2 , y — xy 2 ] T , 
then the factorization obtained by Theorem\4.6\ is given by 



1 


-1/2 


3/2 


T 


' 1 


-1/2 


3/2 


1 





2 




1 





2 


1 


1/2 


5/2 




1 


1/2 


5/2 





-1 


2 







-1 


2 





-V6/2 


V6/2 







-V6/2 


V6/2 





-V3 










-V3 






One can check that this factorization already produces the rational sum of squares 
representation we encountered in Example \1.7\ □ 

We note that when K is an arbitrary number field, Galois over Q, our approach 
still produces a result similar in spirit to Theorem 14.61 The only difference is that 
we must allow negative integers Ik in the statement. 



Theorem 4.8. Let K be a finite Galois extension o/Q and set r = [K 
for any p £ K [xi, . . . , x n ], there are integers l\, . . . , l r such that 



Then 



(ap) 2 = q T CC T q, 



in which q is a vector of polynomials in Q[xi, 
with entries in F = Q(^/h, ■ ■ ■ , VTr: v2). 



, x n ] and C is an r x C^ 1 ) matrix 



The following corollary is the closest we come to answering Sturmfels' question 
in the general case. It follows from applying Theorem 14.81 in the same way that 
Theorem 14.61 will be used below to prove Theorem 15. II 

Corollary 4.9. Let K be a finite extension of Q. If f £ Q[xi, . . . , x n ] is a sum 

of squares over K[x\, . . ., x n ], then it is a difference of two sums of squares over 
Q[xi, ...,x n ]. 

Example 4.10. Consider the degree 2 field extension K = Q(iv2)i which is the 
splitting field of u[x) = x 2 + 2. One can check that setting [b\, 62) = (0, 1), 8 = —1. 
and d = [v2, iVS\ T in Theorem \4-2\ produces the symmetric matrix 

. 2 iV& 

1V6 -2 
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It follows that the 2x2 Vandermonde matrix V2 as in j4-l\ ) satisfies V2V2 = CC T , 
in which 

[11 

[2 -2 2iV3 ' 

This calculation expresses the polynomial f — (x + i\/2y) 2 + (x — iy/2y) 2 as the 
difference 

f = (x + 2y) 2 + (x-2y) 2 -12y 2 . 

□ 

5. Proof of Theorem 12.11 

In this section, we complete the proof of our main theorem. The results so far 
show that if / is a sum of m squares in K[x\, . . . ,x n ] for a totally real field K, 
Galois over Q, then / is a sum of m ■ squares in L[xi, . . . ,x n ], where 

L = Q(y/h, ■ ■ ■ , v^r; v2) for some positive integers Ik- The proof of Theorem l2.1l is 
thus complete if we can show the following. 

Theorem 5.1. Let l-y, . . . , Z r+1 be positive integers and set L = Q(-\/Ji, . . . , y/l r +\). 
If f G Qt^ij ■ • • j x n] is a sum of s squares in L[x\, . . . , x n ], then f is a rational sum 
of at most s ■ 2 r+l squares in Q[x\, . . . ,x n ]. 

Proof. Let I be a positive integer and let L — F(yl) be a quadratic extension of a 
field F of characteristic 0. We shall prove: If / € F\xi, . . . ,x n ] is a rational sum 
of s squares in L[x\, . . . ,x n ], then / is a rational sum of at most 2s squares in 
F[x\, . . . , x n \. The theorem then follows by repeated application of this fact. 

If L = F, then there is nothing to prove. Otherwise, let a 6 Gal(L/F) be such 
that a{\fV) — —\/l, and let / € F[xi, . . . , x n ] be a sum of s squares in L[x\, . . . , x n ]: 

f= E^ = ^E(p? + (^) 2 ). 

i=l i=l 

It therefore suffices to prove that for fixed p E L[x\, . . . , x n ], the clement p 2 + (ap) 2 
is a rational sum of 2 squares. Finally, writing p = a + b^/l for a, b e F[x\, . . . , x n ], 
we have that 

(a + bVl) 2 + (a - bVlf = 2a 2 + 2lb 2 . 
This completes the proof. □ 
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